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Abstract 

This  paper  considers  the  problem  of  making  statistical  inferences 
about  group  judgments  and  group  decisions  using  Qualitative  Controlled 
Feedback,  from  the  Bayesian  point  of  view.  The  qualitative  controlled 
feedback  procedure  was  first  introduced  by  Press  in  1978  (JASA),  for  a 
single  question  of  interest.  The  procedure  is  first  reviewed  here 
including  the  extension  of  the  model  to  the  multiple  question  case.  We 
develop  a  model  for  responses  of  the  panel  on  each  stage.  Many  questions 
are  treated  simultaneously  and  an  autoregressive  model  is  developed  for 
explaining  the  responses  of  the  group  members  as  a  function  of  the  feed¬ 
back.  The  errors  are  assumed  to  follow  a  matrix  intraclass  covariance 
structure.  Marginal  and  conditional  posterior  distributions  of  the 
regression  coefficient  vector  are  found  in  both  small  and  large  samples. 

The  broadly  defined  generic  family  of  multidimensional  Student-t  distri¬ 
butions  is  found  to  play  a  major  role  in  the  results. 
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1 .  Introduction 


Group  judgment  formulation  and  decision  making  using  qualitative 
controlled  feedback  (QCF)  was  introduced  in  Press,  1978.  The  work  was 
extended  to  the  multivariate  case  of  many  questions  in  Press  (1979a). 

In  this  paper  we  carry  the  work  further  by  adopting  the  Bayesian  point 
of  view  and  developing  the  posterior  distribution  of  the  coefficient 
vector  that  relates  individual  responses  of  group  members  to  explanatory 
variabl  es. 

The  methodology  was  originally  conceived  in  order  to  study  how  the  U.S. 

Air  Force  might  be  reorganized.  We  will  motivate  the  procedure,  however, 
in  a  different  context. 

Suppose,  for  example,  a  city  planning  bureau  would  like  to  resolve 
some  public  policy  issues  that  are  of  importance  to  the  city  in  various 
ways.  They  would  like  to  determine  how  to  allocate  the  resources  in  their 
budget  so  that  "appropriate"  funding  is  devoted  to  police,  fire,  and  other 
municipal  services,  consistent  with  environmental  considerations,  political 
considerations,  economic  feasibility,  engineering  and  scientific  constraints 
and  perhaps  other  factors  as  well.  These  factors  affect  most  people  in 
some  possibly  indirect  way,  and  no  one  person  is  likely  to  be  knowledgable 
in  all  related  areas. 

■  *  ■  - 

It  is  decided  to  adopt  a  QCF  procedure  to  assist  the  policy  makers  in  _ 
generating  the  factors  that  argue  for  one  allocation  over  another.  A  sample 
of  panelists  is  taken  from  the  city  population  and  the  panel  members  are  each 
given  a  survey  instrument  that  includes  a  battery  of  questions. 

The  survey  Instrument  could  be  administered  by  mail,  by  telephone, 
by  on-line  computer,  or  whatever.  The  data  collection  protocol  of  QCF 
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requires  that  each  panelist  respond  to  the  questions  independently  of  all 
other  panelists,  and  without  any  panelists  knowing  the  identity  of  any  other 
panelists.  Thus,  the  social  pressures  of  face-to-face  confrontation  in  a 
room,  perhaps  at  the  expense  of  logical  reasoning,  are  avoided. 

In  applying  a  QCF  procedure,  each  respondent  is  typically  asked  to 
answer  a  set  of  basic  questions.  In  addition,  the  subject  is  asked  to 
provide  distinct  reasons  for  each  answer  that  will  help  justify  the 
subject's  answers.  He  will  usually  also  be  asked  to  answer  some  sub¬ 
sidiary  questions  that  will  serve  to  provide  demographic  and  attitudinal 
information  about  the  degree  of  expertise  of  the  subject,  his  likely 
institutional  biases,  etc. 

An  intermediary  is  asked  to  collect  all  the  answers.  This  person  then 
forms  a  merged  composite  of  the  reasons  provided  by  the  panel  for  the  answer 
to  each  question  asked.  This  merging  can  be  carried  out  with  the  aid  of  a 
computer  editor.  That  is,  in  some  situations  this  step  may  be  carried  out 
mechanically  (if  most  reasons  are  listed  in  advance s  panelists  can  check 
them  off  and  a  computer  can  tally  them).  Reasons  can  be  coded  and  classi¬ 
fied  into  some  intrinsically  orthogonal  set  (many  reasons  are  probably  just 
paraphrases  of  one  another).  The  end  product  generated  is  a  composite  of 
reasons  corresponding  to  each  pair  of  questions  and  answers. 

The  composites  of  reasons  are  now  presented  to  each  panelist  in  a  simple 
form  (such  as  a  checklist).  Each  panelist  is  then  asked  to  answer  the  same 
set  of  questions  a  second  time,  only  now,  the  panelist  is  exposed  to  the 
reasoning  used  by  all  other  panelists.  The  numerical  responses  given  by  the 
other  panelists  is  not  provided  for  any  of  the  subjects,  nor  do  they  receive 
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any  other  data,  such  as  sample  group  mean  vectors.  The  composites  of 
reasons  are  the  only  data  fed  back.  As  a  result,  the  second  stage  response 
of  a  panelist  is  likely  to  differ  from  his  first  stage  response  only  because 
he  feels  he  has  ignored  some  arguments  used  by  other  panelists.  Note  that 
panelists  are  not  told  the  proportion  of  panelists  who  gave  a  particular 
reason;  a  panelist  does  not  have  any  basis  for  deciding  how  much  to  weight 
each  reason,  in  his  own  thinking,  other  than  by  adopting  his  own  weighting 
system  according  to  his  own  perceptions  of  value  and  importance. 

This  procedure  is  repeated  until  the  process  stabilizes,  in  the  sense 
that  respondents  are  not  changing  their  responses  very  much  from  stage  to 
stage. 

There  is  room,  however,  for  manipulation  of  the  outcome  by  a  devious 
intermediary  who  might  misrepresent  the  composite  fed  back  to  the  panel  on 
each  stage.  This  effect  can  be  minimized  by  using  a  group  of  intermediaries 
to  accomplish  the  task  of  forming  a  composite  of  reasons. 

Earlier  research  involving  group  decision  making  and  judgment  formu¬ 
lation,  and  the  effects  of  social  interaction  pressures,  is  summarized  in 
Press,  1978.  In  Section  2  we  develop  a  model  for  studying  the  relation¬ 
ships  between  responses  to  the  many  questions,  and  the  rationale  the  panel 
feels  is  most  important  to  explain  the  answers.  The  model  can  also  be  used 
for  predicting  the  next  round's  responses  (in  many  situations,  for  economic 
or  other  reasons,  it  may  be  difficult  or  undesirable  to  carry  out  the  process 
for  one  more  stage). 

The  multiple  question  model  is  treated  In  greater  detail  from  a  sampling 
theory  viewpoint  in  Press  (1979a).  The  methodology  was  applied  to  study  a 
real  problem  in  Press  (1979c).  Section  3  presents  several  distinct  develop¬ 
ments  that  provide  methods  for  making  Bayesian  inferences  useful  for 
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predicting  the  next  round's  responses.  Finally,  Section  4  provides  a 
summary  and  conclusions. 


2.  Multiple  Question  Model 
2.1  First  Stage 

Let  z1-n(j)  denote  the  numerical  response  of  subject  i,  on  stage 
n,  to  question  j;  i  =  1,2,...,N;  j  =  l,2,...,q.  Let  Fn  denote  the 
totality  of  information  obtained  on  stage  n  and  fed  back  to  each 
panelist  at  the  beginning  of  stage  (n+1).  Let  F^  denote  the  n-vector 

( F . ) .  Finally,  let  X:N*r  denote  a  regressor  matrix  of  explanatory  vari- 
J 

ables  observed  for  the  N  panelists  (these  are  answers  to  subsidiary 
questions) . 

For  the  first  stage  model  we  adopt  a  simple  regression  with  uncor¬ 
related  errors  (subjects  respond  independently  on  the  first  stage). 
Accordingly,  assume 


z1 ( j) I x  =  xe(j)  +  u1 (j), 

Et^)  =  0,  var[u.j  ( j )]  =  o*(j)If 


where: 


z,(j)  s  [z11(j),...,zMl(j)]',  u,(j)  =  [u11(j),...,uM1(j)]'  , 


u^(j)  denotes  an  error  term,  and  6 ( j )  denotes  an  rxl  vector  of 
unknown  coefficients.  For  convenience,  take 


V  =  [u  (1) . u.(q)],  V'  =  [v. . vN]  , 

(Nxq)  (Nxl)  (Nxl)  (<?xN)  (qxl)  (qxl) 


and  assume 


♦t  i=j 

0,  i*j 


If 

Z1  =  [z1(l),...,z1(q)],  B  *=  CbO) . B(q)]  . 

(Nxq)  (Nxl)  (Nxl)  (rxq)  *(r*l)  (rxl) 

the  model  may  be  written  in  the  compact  form 

Z1  =  X  B  +  V  4  (1) 

(Nxq)  (Nxr)  (rxq)  (Nxq) 


where: 

E(V)  =  0,  cov(v  ,v.)  =  0,  i^j,  var(v.)  =  4>*. 

I  J  I 

The  model  of  course  represents  a  classical  multivariate  regression.  The 
Gauss-Markov  estimator  of  B  is  therefore 

B  =  (X'X)'1X,Z1  .  (2) 

2.2  Feedback  Stages  (n  >  2) 

For  later  stages,  beyond  the  first,  the  model  must  change.  This  is 
because  the  composites  of  reasons  fed  back  to  each  respondent  cause  their 
responses  to  be  mutually  correlated.  Since  they  all  get  the  same  feedback, 
however,  their  responses  on  the  next  stage  are  likely  to  be  similarly  cor- 
related  (homogeneous,  or  intraclass  correlation  structure).  Moreover, 
their  answers  on  stage  two  are  likely  to  be  related  to  their  answers  on 
stage  one.  Adopt  the  autoregressive  model 
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«i„u> 5 


[z. 

in 


Rn.,(j) 

-  I  c(a)(j)  [1  -  «<“>  (J)j  p'o)(j)  +  u.  (j),  (3) 

^=*1  1  1  ,n- 1  n  *  ** 


where  R^j)  denotes  the  number  of  distinct  reasons  given  by  the  panel 
(this  is  the  number  of  reasons  in  the  composite)  for  the  answer  to  question 

j,  on  stage  n;  {^(j)  is  unity  or  zero,  depending  upon  whether  or  not 
respondent  i  records  reason  a  for  his  answer  to  question  j,  on  stage 
n;  cja^(j)  is  an  unknown  constant  of  proportionality  (to  be  estimated) ; 
and  p^(j)  denotes  the  proportion  of  respondents  who  record  reason  a 

for  question  j,  on  stage  n  (this  will  be  interpreted  as  the  weight  or 
importance  the  panel  gives  to  this  reason).  Note  that  even  though  the  panel 
members  do  not  know  pj|a)(j),  it  can  nevertheless  be  used  in  our  model  since 
the  intermediary  knows  it  or  can  compute  it. 

The  model  in  eqn.  (3)  may  be  interpreted  as  follows. 

A2 ^ n ( J )  represents  the  change  in  response  for  subject  i,  on  question  j, 
from  stage  (n-1)  to  stage  (n).  This  change  results  from  an  incremental 
effect  attributable  to  each  reason  (linear  combination  of  effects).  If  the 
subject  gave  that  reason  on  the  last  stage,  there  is  of  course  no  effect, 
while  if  he  didn't  give  it,  the  effect  is  proportional  to  the  importance 
of  the  reason  (as  measured  by  the  proportion  of  panelists  who  gave  the 
reason) . 

2.3  Error  Structure  (n  a  2) 

Define 


“in  ' 

(q*i) 


* 


u. 

in 


(q)3‘ 


and  assume 


0)  E(u1n)  -  0 

(2)  var(u.  )  *  z 
in  n 

(qxq) 


(3)  cov(u.  ,u .  )  =  | 
in’  jm'  1 


^n.  i7j,n=m  , 

(q*q) 

0,  n^m  . 


For  compactness,  let 


n  1  In 
(Nqxl) 

Then,  E ( )  =  0,  and 

var(u  )  =  fi 
n  n 

(NqxNq) 

n  is  seen  to  be  a  matrix  intraclass  covariance  matrix.  Some  of  its 
n 

properties  are  given,  e.g.,  in  Press,  1972,  pp.  21,  48,  49.  and  in  Press, 
1979b.  The  assumption  of  equal  diagonal  blocks  in  means  we  are  assuming 
multivariate  homoscedasticity.  All  off-diagonal  elements  of  the  qxq  blocks 
are  assumed  to  be  identical  (An).  We  are  therefore  assuming  that  in  many 
situations  it  is  reasonable  to  expect  that  the  panel  will  be  constituted  with 
members  who  are  sufficiently  homogeneous  in  background  so  that  a  pattern  of 
homogeneous  correlation  is  reasonable. 
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2.4  Transformations  to  Canonical  Form 


azin  =  ...«in(q)]’  . 

(qxl ) 


and  assume 


e!o)(j) 


Xi  aa(^  * 

(Nr)  (rxl) 


where  x..  denotes  the  (rxl)  vector  of  explanatory  variables  for  sub¬ 
ject  i,  and  a  (j)  denotes  an  (r*l)  vector  of  unknown  weights.  For 
a 

compactness,  let 


ci  (j) 


CRn_i  ( J)-U 


(1  \  (R  -j  ( J ) ) 

=  [CiP)(j),...,C1  H-1  (j)]*  , 


•‘""’’(J)  *  Caj { j) . a'  <J}( j)3- 

rrRn_,(j).i: 


so  that 


ci  ( j)  =  (I  8  x ! )  a(n_1  }(j)  , 


where  0  denotes  the  direct  product.  Vie  next  combine  all  the  observable 
explanatory  data  into  one  matrix.  Define 

»in(J)  -  U  8  x.)  t.n(j) 


where: 


‘In’fJ)  z  C  -  ‘uWM  z  tt'i’d) . 

[NRnl(j)]  | 
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and  define 


w!  (1)  0 

in 


v  ° 


.  [j(n-1),0) . a{n-’  >■  (q)]1  . 


where: 


(hn-lx,) 


h  =  r  E  R  ,(j)  . 
n-T  }  n- 1 


The  model  now  becomes 


az.  =  W.  a^n_1 ^  +  u. 
in  in  in 

(q*i)  (qxhn1)  ( hn-1  xl )  (qx1) 


Combining  all  subjects,  (5)  becomes 


W  a(n“1}  +  u 

(Nqxl)  (|‘,1qxhn_i  )(hn.-|x1 )  ( Nqx  1 ) 


where : 


AZn  =  (Azin’*-”AZNn^  *  Wn  =  ^Wln’"‘,WNn^ 


interacting  over  the  n  stages  gives 


z  sz-z  =  W-a  +  u 
(Nqxl)  n  1  (Nqxh)  (h*l)  'Nqxl) 


where  for  h  =  Eh., 
j=l  J 


(h*l) 
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Using  eqn.  (21)  in  eqn.  (17),  with  p'(a,n0)  *  p^ffig),  gives  the  joint 
posterior  density 


p(a,flQ|z*,w*)  «  |fl0| 


(2y+Nq+2] 

2 


(-3s)trn~'(H+2J) 


r*P  *(6 ;n ; JRJn~  ) 


The  marginal  posterior  density  of  a  is  found  by  integrating  (22) 
with  respect  to  fig.  The  integration  is  carried  out  by  reference  to 
eqn.  (21),  using  its  normalizing  constant.  The  result  is 


p(a|z*,w*)  « 


1 2  J+(  z*-w*a )  ( z*-w*a ) '  | 


•  (r*+1)Fq*(6.Y-^;n;JRJ(2J+H)_1)  ,  •  (23) 

where:  H  h  (z*-w*a)(z*-w*a)' .  If  we  identify  M  =  2J ,  2y  =  m-Nq-1 ,  and 
take  R  =  0,  it  is  immediately  seen  that  the  result  obtained  in  (18), 
for  the  inverted  Wishart  prior,  is  a  special  case  of  eqn.  (23).  This 
result,  however,  has  the  advantage  of  being  richer  in  parameters  and  can 
therefore  accommodate  a  much  greater  variety  of  types  of  subjective 
information.  Inferences  about  a,  however,  are  more  complicated,  and 
will  require  the  use  of  zonal  polynomial  tables  in  order  to  evaluate  the 
hypergeometric  functions  in  (23)  (see  James,  1974),  The  parameters  of 
the  hypergeometric  functions  are  selected  so  as  to  satisfy  the  block  diagonal 
structure  of  fig. 
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the  elements  of  Jig  that  do  lie  in  the  main  diagonal  blocks  are  simul¬ 
taneously  constrained  in  all  of  their  moments  (by  taking  m  large). 

Such  constraints  may  not  always  be  desirable.  For  the  general  case, 
an  alternative,  prior  for  ftg  which  is  richer  in  parameters  is  recommended. 
Vie  propose  such  a  prior  below. 

Generalized  Prior  Distribution 

A  generalized  family  of  Wishart  type  distributions  was  introduced  by 
Roux,  1971.  The  generalization  includes  hypergeometric  functions  of 
matrix  argument.  A  form  of  the  associated  density  which  widens  the  para¬ 
meter  spaces  is  given  (for  a  general  pds  matrix  X)  by 

.  T-isill 

f(X)  •  c | X |  2  exp(-tr(JX)}  r„F  ,({;n;JRJX)  ,  (20) 

for  X:qxq,  X  >  0,  6  =  (6^ . <5r*)'»  n  =  (n^  •  »n  *) '  ,  J:q*q,  R:qxq, 

J  >  0,  R  >  0,  and  r*F^*(-)  denotes  the  generalized  hypergeometric 
function  of  matrix  argument  (see  Constantine,  1963).  The  normalizing 
constant  is  given  by 


VYV*+i)V*6,Y;n;JR) 


where  T^y)  denotes  a  q-dimensional  gamma  function.  The  parameters 
(6.,n.),  i=l,...,r*,  j=l,...,q*,  are  restricted  to  take  those  values 

*  J 

for  which  f ( X )  is  positive. 

Now  let  JJq  =  X'1,  replace  q  by  Nq  (the  dimension  of  fig),  and 
transform  the  density  in  (20)  to  yield  the  generalized  inverted  Wishart 


density 


p;<V 


Myt 


C ( Nq>TT  - tr( jn0' ) }  r»F  .UjruJROnj1 )  .  (21) 
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Then, 

E^V  =  m-2Nq-2  ’  E*Z2^  =  m-2Nq-2  ’ 

and  E(^q)  =  0  for  all  elements  of  fig  not  in  the  block  diagonal  elements. 

Moreover,  if  n.  =  (tu  for  all  (a, 6)  not  in  the  block  diagonal 

0  aB 

elements , 

m  rn 

,  *  _ aa  BB  _ 

=  Im-2Nq-l)"(m-2Nq-2)(m-2Nq-4y  ’ 

where  M  =  (m^).  Note  that  var(uag)  is  of  order  m  ;  that  is,  var(u)ag) 

2 

goes  to  zero  with  increasing  m  .  We  can  always  choose  m  large  enough 
so  that  all  elements  off  the  block  diagonal  elements  of  fig  are  centered 
at  zero,  with  very  small  variance.  Note  from  eqn.  (19)  that  var(a|z*,w*) 
goes  to  zero  with  increasing  v*  (which  is  linear  in  m).  By  selecting 
(MpM^)  appropriately,  and  choosing  m  sufficiently  large  this  prior 
distribution  will  be  sufficiently  rich  to  accommodate  many  classes  of 
subjective  information. 

This  type  of  prior  is  not  recommended  for  the  general  case,  since 
the  structure  of  the  prior  distribution  is  too  restrictive*  Our  reasoning 
is  that  although  elements  of  fiQ  not  in  the  blocks  on  the  main  diagonal 
are  centered  at  zero  with  arbitrarily  small  variance,  because  there  are 
only  two  parameters  in  the  Inverted  Wishart  distribution,  viz.  (M,m), 


♦The  restrictiveness  of  the  structural  form  of  the  Inverted  Wishart 
distribution  has  already  been  noted  by  Rothenberg,  1963,  in  a  different 
context  (see  References). 


where: 
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a*  =  (w*,M“1w*)“1(w*'H"1z*),  v*  =  m-Nq-h  ; 


Q”!  =  ,  6*  =  1+z**M‘1z*-a*,w*,M'1w*a* 


That  is,  a  follows  an  h-dimensional  Student  t-density  with  mean  a*, 
and  v*  degrees  of  freedom.  Then, 


E(a | z*,w*)  =  a*  =  (w*'M'1w*)"1(w*'M‘1z*)  . 
var(a |z*,w*)  =  Q 

=  (&)  (WH'Vr1  .  (19) 

Discussion  of  Prior 

The  mean  and  variance  of  the  inverted  Wishart  distribution  are  well 
known  (see  e.g.  Press,  1972,  p.  111).  Therefore 


N 

m-2Nq-2 


But  i f  we  subjectively  believe  that 


we  should  take 


19 


and  p' (a ,n)  denotes  the  prior  density.  Note  that  we  are  ignoring  the 
intraclass  structure  of  at  this  point. 

For  the  prior  density,  assume  p'(a,n0)  =  pj  (ajp^iig) ,  and 

pj(a)  «  constant. 


P2(V 


m 

l»nl  ? 


(-ii)tm'  H  >  , 


where  (m,M)  are  assumed  to  be  known  hyperparameters,  M  >  0. 


The  joint  posterior  density  becomes 


p(a.n  |z*,w*)  «  - — -7 X 


exp  |(-%)trft"1(M+H)  J  . 


In  nQ  this  expression  is  the  kernal  of  an  inverted  Wishart  distribution 
so  it  is  readily  integrated  to  give  the  marginal  density 


p(a|z*,w*)  « 


[m-Nq)/2  ’ 


p(a|z*,w*)  « 


jl+(w*a-z*)'M  (w*a-z*) J  2 


Completing  the  square  in  a  gives 


p(alz*,w*) 


jv*+(a-a*)'Q‘1(a-a*)  \  V  2 
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Remark  (2): 

The  large  sample  Bayesian  result  shows  that  the  elements  of  the 
regression  coefficient  vector  a  are,  for  large  N,  jointly  normally 
distributed,  so  that  inferences  about  particular  coefficients  are  readily 
made. 

Remark  (3): 

The  large  sample  8ayesian  result  just  found  is  meaningful  when  the 
number  of  subjects  on  the  panel  is  large;  the  number  of  feedback  stages 
may  still  be  small . 


3.4  Small  Samples 

To  obtain  a  Bayesian  result  useful  in  small  or  moderate  samples  we 
adopt  a  different  point  of  view  than  that  used  in  subsection  3.3.  Our 
approach  now  is  to  first  ignore  the  (possibly)  intraclass  covariance 
structure  in  the  likelihood  function,  but  to  recapture  the  structure  in 
the  prior  distribution. 

We  begin  with  eqn.  (14), 

(z*|w*,a ,fiQ)  %  N(w*a,n0).  (14) 

Thus,  the  posterior  distribution  of  (a.Og)  is 


p(a,nQ|z*,w*) 


p*(a  ,«Q) 


exp 


|(-H)trn"1 


(17) 


where 


H  =  (z*-w*a)(z*-w*a) '  , 
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To  simplify,  complete  the  square  in  a  to  get 


p(a | z ,w)  «  exp  \  (-  H)[(a-a)'F(a-o)3 


where: 


-1. 


N 


a  s  F" 1  b ,  b  =  Z  w*'  (-)_1z*,  F 
2  J  v  J 


N  , 

Z  w*'(-)  w* 
2  J  v  j 


That  is,  a  posteriori,  in  large  samples. 


a  *  N(a,F_1 )  . 


(16) 


The  only  unfinished  item  remaining  in  this  large  sample  approximation 

is  to  show  that  the  latent  roots  of  4»'G_1*  go  to  zero  with  increasing 

sample  size.  The  matrix  4>'G~^«>  =  (r . . ) ,  where 

*  3 

r. .  =  (wta-z*) 'G~\w*a-z*)  . 
ij  i  i  j  J 

But 


w*  =  (wf' 
(Nq*h)  . 


•  >wjj’ ) '  =  (  rQ  0  Iq)w  , 
(NxN) 


where  w  only  changes  in  dimension  with  increasing  N.  But 

-h 

orthogonal  matrix  each  of  whose  elements  is  of  order  N  .  So 
of  order  N"\  So  its  latent  roots  must  vanish  as  N  -*■  ~. 


r.  .  is 
1J 


Remark  (1): 

We  note  that  since  Z^  *  Z  +  (N-l)A,  as  N  gets  large,  Z^  becomes 
very  large,  so  z*  is  less  and  less  informative  as  N  -*■  <*>.  As  a  result, 
ignoring  this  observation  is  no  great  loss  in  large  samples. 
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Large  Sample  Approximation 


Let  ♦=  (w|a-z*,. . .  ,wj|a-z*).  Then,  eqn.  (15)  becomes 
p(a|z,w)  «  |G  +  ♦♦Tv/2  «  jl  +  4>'G"14>rv/2  , 


p(a|z,w)  «  exp  |  (-  j)  1og|lN_^  +  4>*G_1*| 


Let  (A.  ,...,A^  denote  the  latent  roots  of  $'G~  4>,  and  let 
D  =  dlag.  (A. , . . .  »A^_-j ) .  Then 


p(a|z,w)  «  exp|-  j  log|IN  ^  +  Djj 

f  N_1  1 

■  exp  |  -  j  log  H  (1+A.)  > 

f  N*1 

*  exp  <  -  j  Z  log  (1+A  ) l 
1*1  1  J 


It  will  be  shown  shortly  that  *  A.,  decreases  with  sample  size,  N.  Thus, 
for  N  sufficiently  large,  |A,.|<<1,  s0  that  log  (1+A^)  =  A.  .  Then, 


p(a|z,w)  «  exp  |  -  j  J  *i 


=  exp  J  -  Tj-  tr(*'G  ♦  ) 


exp  -r^CwJa-zJJ'G^twJa-z*) 


Each  term  in  the  exponent  is  a  quadratic  form  in  a.  Combining  terms  gives 


p(a|z,w)  «  exp  j  -teJla'U  w*'(^-)  1wj)a 


i+  ( 1 2j'!!r'2j)  •  • 
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That  is,  the  prior  density  of  a  is  vague,  and  the  prior  density  of  l2 
is  inverted  Wishart.  Note  that  (G,n0)  are  assumed  to  be  known  hyper¬ 
parameters.  The  posterior  density  now  becomes 


-psHrr^B+G) 

e 


The  marginal  posterior  density  of  a  is  found  by  integrating  the  joint 
density  of  (a,I2)  with  respect  to  Because  of  the  known  form  of  the 

inverted  Wishart  density,  we  readily  effect  the  required  integration  and 
fi  nd 

p(a|z,w)  « 

where  v=N+nn-q-2.  The  posterior  density  in  eqn.  (15)  is  in  the 
matrix-T  family,  but  is  quite  complicated  analytically.  It  could  always 
be  evaluated  mimerrcal 1  y,  of  course,  but  we  seek  instead  a  large  sample 
approximation.  An  alternative  approach  will  be  developed  for  obtaining 
simple  Bayesian  results  in  small  samples. 


(z*|w*,a ,r1 )  -V  NfwJa.Z^, 

(z* . z*)  are  independent  and 

(z*|w*,a,zj  -v  N(w*a,l  ), 

for  all  j=2,...,N.  A  fundamental  difficulty  at  this  point  is  that  E^ 
depends  on  the  sample  size  N  (since  =  E  +  ( N-l ) A ) .  To  circumvent 
this  difficulty  we  will  seek  a  Bayesian  solution  to  our  problem  which 
ignores  one  data  point,  namely,  z*,  and  then  we  will  seek  a  large  sample 
solution,  so  that  the  loss  of  the  one  data  point  will  be  irrelevant. 

Accordingly,  we  consider  the  joint  posterior  density 

„  „  p'(a,E2)  -teJtrE^B 

p(a,Z2|z,w)  «  (N-l )/2  e 
I  ^2  ■ 

where  p'(a,Z2)  denotes  the  joint  prior  density  of  a  and  Z2> 


B  =  Z  (w*a  -  z*)(w*a  -  z*)'  , 

j — 2  w  J  J  J 


z  =  (z*  ...,z*),  w  =  (w*  ..,w*) 


It  is  interesting  to  note  that  the  sample  covariance  among  the  (z*, 
vectors  follows  a  non-central  Wishart  distribution. 

Adopt  the  prior  density 


p*(a,E2)  =  p'(a)  p£(l2)  , 
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where 


The  posterior  density  of  a  is  found  by  first  reducing  (13)  to  canonical 
forms;  then  adopting  a  prior  for  the  canonical  form  parameters,  and 
finally  applying  Bayes  theorem. 

Define  the  orthogonal  matrix  r  =  Tg  9  1^,  where  Fg  denotes  an 
orthogonal  matrix  of  order  N  whose  first  row  has  equal  elements.  Then 
it  is  strai ghtforward  to  check  (see  Press,  1979b,  Theorem  5)  that  if 

fig  =  rfir'  , 

fig  is  block  diagonal  of  the  form 


E1  =  Z  +  ( N- 1 )a ,  Z2  =  Z-A.  Accordingly,  define  z*  -  rz,  w*  *  rw.  Then, 

(z*|w*,a,fig)  -v  N(w*a,fig)  .  (14) 

We  now  view  eqn.  (14)  as  the  canonical  form  of  the  problem  and  adopt 
(z^,z2,a)  as  the  canonical  parameter  set.  Equivalently,  if 

z*  =  (z*',...,z*')',  w*  =  (w*',...,w*')', 

(Nqxl)  (lxq)  (lxq)  (Nqxh)  (hxq)  (hxq) 

the  canonical  problem  is  the  following: 


12 


That  is,  conditional  on  n,  a  posteriori,  and  adopting  a  vague  prior 
on  a,  a  is  normally  distributed,  centered  at  the  MLE,  with  precision 
matrix  (W'fi_1W). 

We  remark  in  passing  that  a(n)  is  the  same  estimator  found  from 
a  frequentist  point  of  view  in  Press  (1979a). 

3.2  Large  Sample  Estimator 

One  approximate  large  sample  Bayesian  estimator  of  a  may  be  found 
(when  0  is  unknown)  by  using  the  result  obtained  conditional  on  fi, 

and  then  replacing  fi  by  a  consistent  estimator.  This  approach  follows 
the  spirit  used  in  the  frequentist  analysis. 

Suppose  h  is  a  consistent  estimator  of  fi  (for  unknown  fi).  Then, 
the  approximate  posterior  distribution  of  a  is 

(a|z,w,fl)  -  N[a(n).(w’n‘1wr1]  • 

A  consistent  estimator,  n,  is  developed  in  Press,  1979a.  Thus,  in 
large  samples, 

a(n)  =  a(fl), 

and  a  is  approximately  normally  distributed. 

3.3  Marginal  Distribution  of  a 

In  this  subsection  we  find  Bayesian  estimators  based  upon  the  marginal 
posterior  distribution  of  a.  The  likelihood  in  (9)  is  equivalent  to 


(z|W,a,fi)  *  N(wa,n), 


(13) 
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3.1  Known  Covariance  Matrix 

From  (7)  and  (8)  it  follows  that  under  the  assumption  of  normality  on  u, 
the  density  of  the  response  vector  (likelihood  funct.  .r)  given  the  parameters 
and  explanatory  variables,  is 


p(2|M.a.n)  „  -I;-  e-M2-na)'n'1(2-H«)  _ 
Ini'5' 


(9) 


Hence,  if  we  adopt  a  vague  prior  for  a  (assuming  n  is  known),  its  density 
is  given  by 

p ( a )  «  constant. 


so  that  the  posterior  density  is  given  (from  Bayes  theorem)  by 
p(a|z,u,n)  «  e-H(z-Wa ) '  n  (z-Wa)  ^ 


(10) 


Note  that  we  are  using  the  common  Bayesian  convention  of  using  the  symbol 
p(’)  to  denote  a  generic  density  --  the  densities  differ  from  one  another 
according  to  the  arguments  and  conditioning  variables  used. 

Define  the  generalized  least  squares  (and  maximum  likelihood) 
estimator 


a(n)  =  (w'n^w)"1  w'n^z  . 


(11) 


Completing  the  square  in  the  exponent  in  (10)  shows  that 


p(a|z,w,n)  «  e-?s[a-a(n)](w  £i  W)[a-a(n)] 


so  that 


(a[z,w,n)  %  N[5(n),  (w'n^w)'1]  . 


(12) 
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and 

W  =  (W?,...,W),  u  =  Z  u 
(Nqxh)  1  n  (Nqxl)  j-2  J 


The  transformed  error  vector  in  (7)  satisfies  • 


E ( u )  -  0,  var(u)  =  0 


where 

n-1 

r  =  E  (I.), 
j-2  J 


n-1 

A  =  E  (A.)  . 

j-2  3 


(8) 


J 


3.  Bayesian  Inference 

In  this  section  we  examine  the  unknown  coefficient  vector  in  the  ir 

model  defined  by  (7)  and  (8),  from  the  Bayesian  point  of  view.  Four  v 

c 

different  approaches  will  be  taken.  First  we  will  examine  the  coefficient  ; 

m 

vector  conditional  on  the  error  covariance  matrix.  Then,  we  will  develop 
an  approximate  conditional  Bayesian  estimator  which  is  useful  when  samples 
are  large.  This  approach  ignores  the  intraclass  structure  of  the  covariance 

i 

matrix  and  is  useful  for  cases  where  the  intraclass  structure  cannot  be 
assumed.  Next,  in  subsection  3,  we  will  use  the  intraclass  covariance 
structure  when  we  develop  the  marginal  posterior  distribution  of  the  coef¬ 
ficients.  The  result  is  compl icated, and  so  a  large  sample  solution  is  found. 

In  the  final  subsection  we  develop  a  result  which  is  useful  in  small 
sampl es . 
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4.  Conclusions  and  Summary 

The  qualitative  controlled  feedback  process  of  forming  group  judgments 
and  making  decisions  has  been  examined  from  a  Bayesian  viewpoint.  The 
group  responses  to  many  questions  was  modeled  as  an  autoregressive  process 
with  coefficient  vector  a. 

It  was  shown  that  if  the  error  ~ovariance  matrix,  n,  is  known,  the 
posterior  distribution  of  a  is  normal,  and  centered  at  the  generalized 
LSE.  In  large  samples,  if  fl  is  unknown,  a  consistent  estimator  may  be 
used  to  make  conditional  inferences  about  a. 

Bayesian  inferences  can  also  be  made  marginally,  without  reference 
to  fl.  Assuming  intraclass  covariance  structure,  the  marginal  posterior 
distribution  of  a  was  shown  to  be,  approximately,  a  complicated  member 


of  the  matrix  T  family  of  distributions.  We  developed  a  normal  distri¬ 
bution  approximation  which  is  very  useful  in  large  samples,  however.  For 
small  sample  situations  involving  the  intraclass  covariance  structured 
situation  we  developed  a  posterior  multivariate  Student  t-density  for  a. 
This  result  although  useful  for  many  situations  is  somewhat  restrictive 
in  the  types  of  prior  information  it  will  accommodate.  A  more  general 
result  was  obtained  using  generalized  inverted  Wishart  distribution 
priors.  The  result  is  more  complicated  to  use,  however. 

Finally,  note  that  the  entire  QCF  process  is  subjective  in  nature. 

It  is  therefore  not  surprising  that  inferences  about  the  relationship 

between  the  responses  of  the  panel  members,  and  their  individual  charac- 

\ 

teristics  and  judgmental  behavior  regarding  the  reasons  other  panelists 
give,  would  depend  heavily  upon  the  nature  and  quantity  of  the  prior 
information  available. 


■  »  •  _  •  .  *  *'•  »  •  _  •  -*»  .%  *•  .  i  .  •  .  »  ’• 
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